Rate - Optimal Graphon Estimation
نویسندگان
چکیده
Network analysis is becoming one of the most active research areas in statistics. Significant advances have been made recently on developing theories, methodologies and algorithms for analyzing networks. However, there has been little fundamental study on optimal estimation. In this paper, we establish optimal rate of convergence for graphon estimation. For the stochastic block model with k clusters, we show that the optimal rate under the mean squared error is n−1 log k + k/n. The minimax upper bound improves the existing results in literature through a technique of solving a quadratic equation. When k ≤ √ n logn, as the number of the cluster k grows, the minimax rate grows slowly with only a logarithmic order n−1 log k. A key step to establish the lower bound is to construct a novel subset of the parameter space and then apply Fano’s lemma, from which we see a clear distinction of the nonparametric graphon estimation problem from classical nonparametric regression, due to the lack of identifiability of the order of nodes in exchangeable random graph models. As an immediate application, we consider nonparametric graphon estimation in a Hölder class with smoothness α. When the smoothness α ≥ 1, the optimal rate of convergence is n−1 logn, independent of α, while for α ∈ (0, 1), the rate is n 2α α+1 , which is, to our surprise, identical to the classical nonparametric rate.
منابع مشابه
Rates of Convergence of Spectral Methods for Graphon Estimation
This paper studies the problem of estimating the grahpon model – the underlying generating mechanism of a network. Graphon estimation arises in many applications such as predicting missing links in networks and learning user preferences in recommender systems. The graphon model deals with a random graph of n vertices such that each pair of two vertices i and j are connected independently with p...
متن کاملOracle inequalities for network models and sparse graphon estimation
Inhomogeneous random graph models encompass many network models such as stochastic block models and latent position models. We consider the problem of statistical estimation of the matrix of connection probabilities based on the observations of the adjacency matrix of the network. Taking the stochastic block model as an approximation, we construct estimators of network connection probabilities ...
متن کاملOptimal Estimation and Completion of Matrices with Biclustering Structures
Biclustering structures in data matrices were first formalized in a seminal paper by John Hartigan [15] where one seeks to cluster cases and variables simultaneously. Such structures are also prevalent in block modeling of networks. In this paper, we develop a theory for the estimation and completion of matrices with biclustering structures, where the data is a partially observed and noise cont...
متن کاملStochastic blockmodel approximation of a graphon: Theory and consistent estimation
Non-parametric approaches for analyzing network data based on exchangeable graph models (ExGM) have recently gained interest. The key object that defines an ExGM is often referred to as a graphon. This non-parametric perspective on network modeling poses challenging questions on how to make inference on the graphon underlying observed network data. In this paper, we propose a computationally ef...
متن کاملConsistent estimation of exchangeable graph models by Sorting-And-Smoothing (SAS)
The classical results of Aldous, Hoover and Kallenberg showed that exchangeable random arrays can be represented by some measurable functions called graphons. However, consistent estimation of the graphons remains an open issue. In this paper, we first briefly discuss the identifiability issue of the graphon estimation problem. We observe, in particular, that a graphon can be uniquely estimated...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2014